Abstract 



The Monge-Kantorovich transportation problem involves optimiz- 
ing with respect to a given a cost function. Uniqueness is a fun- 
damental open question about which little is known when the cost 
function is smooth and the landscapes containing the goods to be 
transported possess (non-trivial) topology. This question turns out to 
be closely linked to a delicate problem 111) of Birkhoff p3]: give 
{N| a necessary and sufficient condition on the support of a joint prob- 

(3J[) ability to guarantee extremality among all measures which share its 

marginals. Fifty years of progress on Birkhoff 's question culminate in 
Hestir and Williams' necessary condition which is nearly sufficient for 
\Q extremality; we relax their subtle measurability hypotheses separating 

necessity from sufficiency slightly, yet demonstrate by example that 
i— —i to be sufficient certainly requires some measurability. Their condition 

amounts to the vanishing of the measure 7 outside a countable alter- 
PH nating sequence of graphs and antigraphs in which no two graphs (or 
t£h two antigraphs) have domains that overlap, and where the domain 

of each graph / antigraph in the sequence contains the range of the 
£3 succeeding antigraph (respectively, graph). Such sequences are called 

numbered limb systems. We then explain how this characterization can 
t-h be used to resolve the uniqueness of Kantorovich solutions for optimal 

transportation on a manifold with the topology of the sphere. 
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1 Introduction 

This survey weaves together two themes: the first is Monge's 1781 prob- 
lem [81] of transporting mass from a landscape X to a landscape Y so 
as to minimize the average cost c(x, y) per unit transported; the second 
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is Birkhoff 's 1948 problem [T4] of characterizing extremality among doubly 
stochastic measures on the unit square. 

The first problem has become classical in the calculus of variations; it 
has deep connections to analysis [98] [73] [73] [30] [51], geometry [83] [29] 
[95] [71] [70] pH] [79] [63], dynamics [7J [TO] [5H] [H2] and nonlinear partial 
differential equations [H] [20] [21] [36] [32] [72] [102], as well as applications 
in physics [3E] [97] [75], statistics [88], engineering [15] p] [57] [EE] [TUT] , 
atmospheric modeling [33] [87] [35] [33], and economics [23] [25] [31] [27] 
|49j . The second is a problem in functional analysis, at the junction between 
measure theory and convex geometry. It is not evident that either involves 
differential topology. 

The two problems are linked by Kantorovich's reformulation of Monge's 
nonlinear minimization as an (infinite-dimensional) linear program [60] |61j . 
In this framework, existence of solutions became straightforward for any con- 
tinuous cost c G C(X x Y). Still, fifty more years would elapse before the 
optimal volume-preserving map between two arbitrary domains sought by 
Monge was constructed for the Euclidean distance c(x, y) — \x — y\ in [3] [22] 
and [99J . Evans and Gangbo had already solved the analogous problem with 
the domains replaced by disjoint Lipschitz continuous probability densities 
[32], while Sudakov's earlier construction [96j required a claim which turned 
out to be true only two in dimensions [3] [12] ; see [23] [26] [12] for simplifica- 
tions and [33] [6] [9] j37J for extensions. Uniqueness fails in this context [53] 
|45j . In the meantime both Monge and Kantorovich problems were found to 
enjoy unique solutions for strictly convex costs such as c(x,y) — \x — y\ p /p, 
with p = 2[I7J[T8][ID]!32]andp>l[T9][^ general crite- 

rion for existence and uniqueness of optimal maps was identified by Gangbo 
[52] and Levin |66J, building on works of those cited above. For any pair of 
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destinations y\ ^ y 2 in Y, it prohibits the function 

x G X i — > c(x, yi) - c(x, y 2 ) (1) 

from having critical points on X. Strictly convex functions c(x, y) = h(x — y) 
on X = Y = R n [19] [53] satisfy this condition — called the twist criterion 
in [104] - but no differentiable cost c G C 1 (X x Y) satisfies it on any 
compact manifold X without boundary. Although terrestrial transportation 
takes place on the sphere, there are few theorems set in topologies other than 
the ball — not to speak of the more exotic landscapes which arise naturally 
in some applications. Spherical examples typically show that uniqueness of 
Kantorovich solutions holds even though Monge solutions fail to exist |55j. 
Building on these developments, one of the goals of this article is to expose a 
criterion for uniqueness of Kantorovich solutions which works equally well on 
the sphere and the ball [27] . Called the subtwist by Chiappori McCann and 
Nesheim, this criterion depends on the Morse structure of the cost globally: 
it permits the function ([I]) to have up to two critical points on X - - a 
unique global minimum and a unique global maximum. Unfortunately, this 
cannot be satisfied in more exotic topologies such as the /c-holed torus (k > 
1), where uniqueness remains a tantalizing open question. Our discussion 
is predicated on global differentiability of the cost, since a wide variety of 
existence and uniqueness results concerning optimal solutions to the Monge- 
Kantorovich problem have been established for costs with singular sets - 
including distances in Riemannian [2S] [77] [IB], sub-Riemannian [5] [2] [20] 
and Alexandrov [TT] spaces, and the mechanical actions arising from Tonelli 
Lagrangians [9 J [43J. 

The proof that the subtwist condition is sufficient for uniqueness relies on 
progress in Birkhoff's problem of characterizing extremal doubly stochastic 
measures on the square. This problem is esoteric and subtle: although still 
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not completely resolved, substantial results have been obtained in the six 
decades since it was posed [2H] [SI] [SH] [S] [58J. Highlights are surveyed 
below. 

The literature surrounding Birkhoff's problem is modest, compared to 
the recent explosion of research on the Monge-Kantorovich transportation 
problem. We expect the main interest of this article will therefore lie in its 
connection to the latter developments. For simplicity of exposition, however, 
we postpone a further description of these connections to section [5] The 
earlier sections are devoted to Birkhoff's problem and the issues surrounding 
it. Although less familiar than the Monge-Kantorovich theory to most of 
our readership, the developments surveyed are elementary yet powerful; they 
require nothing more sophisticated than measure theory to discuss. Readers 
in need of motivation — or those interested primarily in optimal transporta- 



tion — are encouraged to skip directly to Theorem 5T for a preview of the 
intended application. 

2 Extremal doubly stochastic measures 

An n x n doubly stochastic matrix refers to a matrix of non-negative entries 
whose columns and rows each sum to 1. The doubly stochastic matrices form 
a convex subset of all n x n matrices — in fact a convex polytope, whose 
extreme points are in bijective correspondence with the n! permutations on 
n-letters, according to Birkhoff p3] and von Neumann [105J. For example, 
the 3x3 doubly stochastic matrices, 

s t 1 — s — t 

u v 1 — u — v 

1 — s — u 1—t—v s+t+u+v—1 
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form a 4-dimensional polytope with 6 vertices. Shortly after proving this 
characterization, Birkhoff [HI Problem 111] initiated the search for a infinite- 
dimensional generalization, thus stimulating a line of research which remains 
fruitful even today. 

A doubly stochastic measure on the square refers to a non-negative Borel 
probability measure on [0, l] 2 whose horizontal and vertical marginals both 
coincide with Lebesgue measure A on [0,1]. The set of doubly stochastic 
measures forms a convex set we denote by T(A, A) (which is weak-* compact 
in the Banach space dual to continuous functions C([0, l] 2 ) normed by their 
suprema || • \\<x,)- A measure is said to be extremal in T(A, A) if it cannot be 
decomposed as a convex combination 7 = (1 — t)j + £71 with < t < 1 and 
< 7o 7^ 7i £ r(A, A). Since the Krein-Milman theorem asserts that convex 
combinations of extreme points are dense (in any compact convex subset of 
a topological vector space, Figure [T]), it is natural to want to characterize 
the extreme points of T(A, A). Another motivation for such a characteriza- 
tion is that every continuous linear functional on T(A, A) is minimized at an 
extreme point. Whether or not this extremum is uniquely attained can be 
an interesting question, as in the optimal transportation context: in Figure 
[T]the horizontal coordinate is minimized at a single point but maximized at 
two extreme points (and along the segment joining them). 

Motivated by the optimization problems already mentioned, we prefer to 
formulate the question in slightly greater generality, by replacing the two 
copies of ([0, 1], A) with probability spaces (A, /1) and (Y, z/), where X and Y 
are each subsets of a complete separable metric space, and \l and v are Borel 
probability measures on X and Y respectively. This widens applicability of 
the answer to this question without increasing its difficulty. Letting r(/i, v) 
denote the Borel probability measures onIx7 having fi and v for marginals, 
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Figure 1: Krein-Milman asserts a compact convex set K can be reconstructed 
from its extreme points (denoted here by solid circles • and solid lines — ). 

we wish to characterize the extreme points of the convex set T(fi, u). Ideally, 
as in the finite-dimensional case, this characterization would be given in 
terms of some geometrical property of the support of the measure 7 in X x Y . 
Indeed, if \i = YmLi m i^xi and v = Y^=i n j^yj are finite, our problem reduces 
to characterizing the extreme points of the convex set A of m x n matrices 
with prescribed column and row sums: 

n m 

A = {ciij > I rrii = 2J %, 2J a ij = n A- 

j=i i=i 

A matrix (a^) is well-known to be extremal in A if and only if it is acyclic, 
meaning for every sequence a il j 1 , . . . , a ik j k of non-zero entries occupying k > 2 
distinct columns and k distinct rows, the product . . . a ik _ 1 j k a ik j 1 must 
vanish — see Figure [2] or Denny [37] , where the terminology aperiodic is 
used. Similarly, a set S C X x Y is acyclic if for every k > 2 distinct 
points {xi, . . . , Xk} C X and {yi, . . . ,y k } C Y, at least one of the pairs 
(xi2/i), (x 1 ,y 2 ), (22,2/2), • • • 5 ( x k-i,Vk), (xk,Vk), (xk,yi) lies outside of S. 

A functional analytic characterization of extremality was supplied by 
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Figure 2: In an acyclic matrix the product of x's and o's must vanish. 



Douglas [39] and by Lindenstrauss [67J: it asserts that 7 is extremal in v) 
if and only if L l {X, dp) © L?-(Y, dv) is dense in L X (X x Y, d^y). Although this 
result is a wonderful starting point, it is not quite the characterization we 
desire for applications, since it is not easily expressed in terms of the geom- 
etry of the support of 7. Significant further progress was made by Benes 
and Stepan, who showed every extremal doubly stochastic measure vanishes 
outside some acyclic subset S C X x Y [8] . Hestir and Williams refined this 
condition, showing that it becomes sufficient under an additional Borel mea- 
surability hypothesis which, unfortunately, is not always satisfied [58] . Some 
of the subtleties of the problem were indicated already by Losert's counterex- 
amples [69J. The difficulty of the problem resides partly in the fact that any 
geometrical characterization of optimality must be invariant under arbitrary 
measure-preserving transformations applied independently to the horizontal 
(abscissa) and vertical (ordinate) variables. 

In the next two sections we review this line of research, clarifying the 
nature of the gap separating necessity from sufficiency and pointing out that 
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it can be narrowed slightly by replacing the Borel cr-algebra with suitably 
adapted measure-completions. We give a self-contained proof of that part of 
the theory which is needed to resolved the uniqueness of optimal transporta- 
tion with respect to a smooth cost on the sphere. This application was first 
developed in an economic context by Chiappori, McCann, and Nesheim [27J, 
and forms the subject of the final section of the present manuscript. 

3 Measures on graphs are push-forwards 

Before recalling the characterization of interest, let us develop a bit of nota- 
tion in a simpler setting, and a key argument that we shall require. Impatient 
or knowledgeable readers can proceed directly to the final sections below, re- 
ferring back to the present section only as needed. 

Let X and Y be subsets of complete separable metric spaces, and fix a 
non-negative Borel measure \x on X. Suppose / : X — > Y is /i-measurable, 
meaning f~ l (B) is in the cx-algebra completion of the Borel subsets of X 
with respect to the measure fi, whenever B is relatively Borel in Y. Then a 
Borel measure on Y is induced, denoted /#// and called the push-forward of 
\i through /, and given by 

{Un)[B]:=n[r\B)\ (2) 

for each Borel B C Y . Defining the projections tv (x, y) = x and n Y (x, y) — y 
on X x Y, this notation permits the horizontal and vertical marginals of a 
measure 7 > on X x Y to be expressed as 7r^7 and 71^7 respectively. 

The next lemma shows that any measure supported on a graph can be 
deduced from its horizontal marginal. It improves on Lemma 2.4 of |55J and 
various other antecedents, by using an argument from Villani's Theorem 5.28 
1 104] to extract /z-measurability of / as a conclusion rather that a hypothesis. 
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As work of, e.g., Hestir and Williams [HS] implies, although measures on 
graphs are extremal in r(/x, u), the converse is far from being true; this 
peculiarity is an inevitable consequence of the infinite divisibility of (X,/i). 

Lemma 3.1 (Measures on graphs are push- forwards) Let X and Y be 

subsets of complete separable metric spaces, and 7 > a a-finite Borel 
measure on the product space X x Y. Denote the horizontal marginal of 
7 by n := vr^7. If 7 vanishes outside the graph of f : X — > Y, meaning 
{(x,y) G X x Y I y 7^ f(x)} has zero outer measure, then f is fi-measurable 
and 7 = (idx x /)#/■*; where idx x / denotes the map x G X 1 — > (x, f(x)) G 
X xY. 

Proof. Since outer-measure is subadditive, it costs no generality to as- 
sume the subsets X and Fare in fact complete and separable, by extending 7 
in the obvious (minimal) way. Any a-finite Borel measure 7 is regular and o- 
compact on a complete separable metric space; e.g. p. 255 of [40J or Theorem 
1-55 of [103J. Since 7 vanishes outside Graph(/) := {(x, f(x)) \ x G X}, there 
is an increasing sequence of compact sets Ki C K i+ i C Graph(/) whose union 
= lim^oo Ki contains the full mass of 7. Compactness of Ki C Graph(/) 
implies continuity of / on the compact projection Xi := n x (Ki). Thus 
the restriction f^ of / to Xoo := ^(Koo) is a Borel map whose graph 
Koo = Graph (/oo) is a a-compact set of full measure for 7. We now ver- 
ify that 7 and (idx^ x /oo)#A i assign the same mass to each Borel rectangle 
UxV C 1x7. Since (Ux V)nGraph(/ 00 ) = ((C/n/- 1 ^)) x K)nGraph(/ 00 ) 
we find 

7 (l/xV) = 7 ((t/n/ oo 1 0/))xY) 

= /i^n/^ 1 ^)), 
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proving 7 = (idx x x /oo )#/•*• Taking U = X\ X^ and V = K shows X \ X^ 
is /^-negligible. Since idx x / differs from the Borel map idx x x only 
on the /^-negligible complement of the u-compact set Xoo, we conclude / is 
/i-measurable and 7 = (idx x as desired. ■ 

The preceding lemma shows that any measure concentrated on a graph is 
uniquely determined by its marginals; 7 is therefore extremal in T(ir^'y } 7r #7)- 
As the results of the next section show, the converse is far from being true. 

4 Numbered limb systems and extremality 

In this section we adapt Hestir and Williams [58J notion of a numbered limb 
system — also called an axial forest or a limb numbering system — to X xY. 
Using the axiom of choice, Hestir and Williams deduced from the acyclicity 
condition of Benes and Stepan [8] that each extremal doubly stochastic mea- 
sure vanishes outside some numbered limb system. Conversely, they showed 
that vanishing outside a numbered limb system is sufficient to guarantee 
extremality of a doubly stochastic measure, provided the graphs (and anti- 
graphs) comprising the system are Borel subsets of the square. Our main 
theorem gives a new proof of this converse in the more general setting of sub- 
sets X x Y of complete separable metric spaces, and under a slightly weaker 
measurability hypothesis on the graphs and antigraphs. A simple example 
shows that some measurability hypothesis is nevertheless required. In the 
next and final section, we shall see how this converse relates to the question 
of uniqueness in optimal transportation. 

Given a map / : D — > Yaa.Dc X, we denote its graph, domain, range, 
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and the graph of its (multivalued) inverse by 

Graph(/) := {(x, f(x)) \ x G D}, 

Dom/:= 7r x (Graph(/)) = D, 

Ran/:= tt y (Graph(/)), 

Antigraphy) := {(f(x),x) \ x G Dom/} CY x X. 

More typically, we will be interested in the Antigraph(g) C X x Y of a map 
g : D CY — ► X. 
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Figure 3: The subsets Ik need not be connected; in this numbered limb 
system they are represented as connected sets for visual convenience only. 

Definition 4.1 (Numbered limb system) Let X and Y be Borel subsets 
of complete separable metric spaces. A relation S C XxYis a numbered limb 
system if there is a countable disjoint decomposition of X = and of 

Y = U^ /2i with a sequence of maps f 2 i : Dom(/ 2 j) C Y — > X and / 2 i+i : 
Dom(/ 2i+1 ) C X — > Y such that S = Graph(/ 2i _i) U Antigraphy), 
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with Dom(/ fe ) U Ran(/ fe+1 ) C Ik for each k > 0. The system has (at most) 
N limbs i/Dom(/ fc ) = /or a// k > N. 

Notice the map f is irrelevant to this definition though I is not; we 
may always take Dom(/ ) = 0, but require Ran(_/i) C I . The point is 
the following theorem and its corollary, which extends and relaxes the result 
proved by Hestir and Williams for Lebesgue measure \i = v = A on the 
interval X = Y = [0, 1]. In it, v) denotes the set of non-negative Borel 
measures onIx7 having /i = ir^'-f and v = 71^7 for marginals. As in the 
preceding lemma, we say 7 vanishes outside of S C X x Y if 7 assigns zero 
outer measure to the complement of S in X x Y. 

Theorem 4.2 (Numbered limb systems yield unique correlations) 

Let X and Y be subsets of complete separable metric spaces, equipped with 
a -finite Borel measures fi on X and v onY . Suppose there is a numbered 
limb system S = U"^ Graph(/ 2 j-i) U Antigraph(/ 2 i) with the property that 
Graph(/ 2 i-i) and Antigraph(/ 2 j) are ^-measurable subsets of X xY for each 
i > 1 and for every 7 G T(fi,u) vanishing outside of S. If the system has 
finitely many limbs or /i[X] < 00, then at most one 7 G v) vanishes 
outside of S. If such a measure exists, it is given by 7 = Y^k=i ^ k w ^ere 



Here fk is measurable with respect to the rjk completion of the Borel a-algebra. 
If the system has N < 00 limbs, 7^ = for k > N, and rjk and 7^ can be 
computed recursively from the formulae above starting from k = N . 

Proof. Let S = Graph(/ 2i _i) U Antigraph(/ 2i ) be a numbered limb 
system whose complement has zero outer measure for some a-finite measure 



7 2i _i = (id x x / 2 j_i)#?7 2 j-i, 7 2i = (f 2 i x id Y )#r] 2 . 



(3) 
(4) 
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< 7 £ r(/i, v). This means that Ik D Dom gives a disjoint decomposition 
of X = U^ J 2 i + i and of Y— U°l I 2i , and that Ran(/ fc ) C Ik-i for each k > 1. 
Assume moreover, that Graph(/ 2 i) and Antigraph(/ 2 i-i) are 7- measurable 
for each i > 1. We wish to show 7 is uniquely determined by /x, z/ and S\ 

The graphs Graph(/2i-i) are disjoint since their domains I 2 i-x are dis- 
joint, and the antigraphs Antigraphy*) are disjoint since their domains I 2 % 
are. Moreover, Graph(/2i-i) is disjoint from Antigraphy.?) for all i,j > 1: 
Ran(/ 2 i-i) C hi-2 prevents Graph(/ 2 i-i) from intersecting Antigraph(/ 2 j-2) 
unless j = i since the domains hj-2 are disjoint, and Graph(/ 2 i-i) can- 
not intersect Antigraph(/ 2 «-2) since Dom(/ 2 i-i) C hi-i is disjoint from 
Ran(/ 2 i-2) C his- 

Let 7fc denote the restriction of 7 to Antigraphy) for k even and to 
Graph(/fc) for k odd. Then 7 = Yllk by our measurability hypothesis, 
and 7fc restricts to a Borel measure on X x Dom/fc if k is even, and on 
Dom/fc x Y if k odd. Defining the marginal projections /if. = ir^k and 
Uk = vr^7fc, setting r/k = Vk if k even and r\k = if k odd yields (3) and 
the ?7fc-measurability of fk immediately from Lemma |3.1| Since u 2 i vanishes 
outside Dom/a, from v = Y^T=i u k we derive u 2i = (u - J2 k ^ 2i 2^)|Dom/ 2i - 
For k even, Vk vanishes outside Dom fk C Ik, while for k odd, Vk vanishes 
outside Ran/fc C Ik-i, which is disjoint from Domfa unless k — 2i + 1. 
Thus ?72i = — ^2i+i)|Dom/ 2l - The formula (El) for 7721-1 follows from similar 
considerations. 

It remains to show the representation (|3j) — specifies (jkiVk) uniquely 
for all k > 1, and hence determines 7 = J^7fc uniquely. If the system has 
N < 00 limbs, Ik = for k > N and hence 7^ = 0. We can compute i]k 
and 7fc starting with k = N, and then recursively from the formulae above 
for k = N — 1,N — 2, . . . , 1, so the formulae represent 7 uniquely. If instead 
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S has countably many limbs, suppose there are two finite Borel measures 7 
and 7 vanishing outside of S and having the same marginals \i and v. For 
each k > 1, recall that 



K, 



Graph(/fc) k odd, 
Antigraphy) k even, 



is measurable with respect to both 7 and 7- Given e > 0, take N large enough 
so that both 7 and 7 assign mass less than e to \J^L N Kk. Set 7^ = 7^ 
and 7fc = 7|x fe and denote their marginals by (^k,^k) — (^#7fc,^#7fc) an d 
(A**., Pfe) = (7r|7 fc , 7r^7 fc ). Observe that both 7 e := Y^k=i 7fc and T : = SfcLi 7fc 
are concentrated on the same numbered limb system; it has finitely many 
limbs, and the differences 5p, e = ^2 k=1 (ftk — fJ>k) an d & V<L — J2k=i(^ k ~~ u k) 
between the marginals of 7 e and 7 e have total variation at most 2e. Since 
the <5/i 2 j_i = p>2i-i — A*2i-i are mutually singular, as are the bv^i = v 2 i — z/ 2 j, 
we find the sum of the total variations of 



5r)k ■-- 



fj, k — ii k k odd, 
\>k — Vk k even, 



is bounded: Y,k=i ll<%i|TV(Dom/ fc ) < 4e. Using (3) to derive 

11- _ || _ / \\(idx x fk)#8Vk\\TV(XxY) k odd, 

117. lk\\TV { xxY) - I \\(f k x zd Y ) # 6 Vk \\ TV{XxY) k even, 

= ||^fc||TV(Dom/ fc ) 

and summing on k yields ||7 e — YWtv(XxY) < 4e. Since 7 e — )■ 7 and 7 e — ^ 7 
as e — » 0, we conclude 7 = 7 to complete the uniqueness proof. ■ 

As in Hestir and Williams [HE], the uniqueness theorem above implies 
extremality as an immediate consequence. 

Corollary 4.3 (Sufficient condition for extremality) Let X and Y be 

subsets of complete separable metric spaces, equipped with a-finite Borel mea- 
sures fi on X and v on Y '. Suppose there is a numbered limb system S = 
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Graph(/ 2 j~i) U Antigraph(/ 2 i) with the property that Graph (/ 2 j_i) and 
Antigraph(/ 2 j) ore ^-measurable subsets of X x Y for each i > 1, for every 
7 G r(/i, v) vanishing outside of S . If the system has finitely many limbs or 
lx[X] < oo, then any measure 7 G r(/i, v) vanishing outside of S is extremal 
in the convex set r(/x, v). 

Proof. Suppose a measure 7 G r(/i, z/) vanishes outside a numbered limb 
system S satisfying the hypotheses of the corollary. If 7 = (1—^)70 + ^71 with 
70,71 G v) and < t < 1, then 7 > 70 and 7 > 71, so both 70 and 71 



vanish outside of S. According to Theorem |4.2[ they are uniquely determined 
by 5* and their marginals, hence 70 = 71 to establish the corollary. ■ 

The following example confirms that a measurability gap still remains 
between the necessary and sufficient conditions for extremality. It is a close 
variation on the standard example of a non-Lebesgue measurable set from 
real analysis. Together with the lemma and theorem preceding, this example 
makes clear that measurability is required only to allow the graphs to be 
separated from each other and from the antigraphs in an additive way. 

Example 4.4 (An acyclic set supporting non-extremal measures) 

Let A denote Lebesgue measure and define the maps fo{x) = x and f\(x) = 
x + a/2 (mod 1) on the unit interval X — Y — [0, 1]. Notice Graph(/j) C 
[0, l] 2 supports the doubly stochastic measure 7^ = (id x fi)#\ for i = and 
i = 1; (both measures are extremal in T(A, A) by Corollary Irrationality 
of a/2 implies S = Graph(/o) U Graph (/1) is an acyclic set, hence can be 
expressed as a numbered limb system according to Hestir and Williams [58]. 
On the other hand, there are doubly stochastic measures such as 7 := ^(70 + 
71) which vanish outside of S but which are manifestly not extremal. 
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5 Uniqueness of optimal transportation 



In this section we apply the foregoing results to the uniqueness question for 
optimal transportation on manifolds, which arises when one wants to use a 
continuum of sources to supply a continuum of sinks (modeled by \i and v 
respectively) as efficiently as possible. 

Given subsets X and Y of complete separable metric spaces equipped 
with Borel probability measures, representing the distributions \i of produc- 
tion on X and v of consumption on Y, the Kantorovich-Koopmans [60] [65] 
transportation problem is to find 7 G v) correlating production with 
consumption so as to minimize the expected transportation cost 



against some continuous function c G C(X x Y). Hereafter we shall be 
solely concerned with the case in which X is a differentiable manifold, /x is 
absolutely continuous with respect to coordinates on X, and the cost function 
c G C 1 (X x Y) is differentiable with local control on the magnitude of its x- 
derivative d x c(x, y) uniformly in y; for convenience we also suppose Y to be a 
differentiable manifold and c is bounded, though this is not really necessary: 
substantially weaker assumptions also suffice [21]; c -f- [HI] [HE] [IB]- 

In this setting one immediately asks whether the infimum ^ is uniquely 
attained. Since attainment is evident, the question here is uniqueness. If c 
satisfies a twist condition, meaning x G X 1 — > c(x, yi)—c(x, y 2 ) has no critical 
points for y\ ^ y 2 G Y, then we shall see that not only is the minimizing 
7 unique, but its mass concentrates entirely on the graph of a single map 
fx : X — > Y (a numbered limb system with one limb), thus solving a form 
of the transportation problem posed earlier by Monge [81J [61J. This was 
proved in comparable generality by Gangbo [52] and Levin [66] (see also 



inf / 

7er(M,i>) JxxY 




(5) 
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Ma, Trudinger and Wang [72]), building on the more specific examples of 
strictly convex cost functions c(x,y) = h(x — y) in X = Y = R n analyzed 
by Caffarelli [IE], Gangbo and McCann [53] \J2\, Riischendorf [89] [90] and 
in case h(x) = \x\ 2 by Abdellaoui and Heinich [1], Brenier [17] [18], Cuesta- 
Albertos, Matran, and Tuero-Diaz [31] [32], Cullen and Purser [31] [35] [87], 
Knott and Smith [HI] [93], and Riischendorf and Rachev [91]. Adding further 
restrictions beyond this twist hypothesis allowed Ma, Trudinger, Wang [72] 
[100] . and later Loeper [55] . to develop a regularity theory for the map /i : 
X — )• F, embracing Delanoe [36], Caffarelli [20] [21] and Urbas' [102] results 
for the quadratic cost, Gangbo and McCann's for its restriction to convex 
surfaces [55], and Wang's for reflector antenna design [106] . which involves 
the restriction of c(x, y) — — log \x—y \ to the sphere [57] [107] . Unfortunately, 
the twist hypothesis, also known as a generalized Spence-Mirrlees condition 
in the economic literature, cannot be satisfied for smooth costs c on compact 
manifolds X x Y, and apart from the result we are about to discuss there 
are no general theorems which guarantee uniqueness of minimizer to (g in 
this context. With this in mind, let us state our main theorem, a version of 
which was established in a more complicated economic setting by Chiappori, 
Nesheim, and McCann [27]. The streamlined formulation and argument 
given below should prove more interesting and accessible to a mathematical 
readership. 

Theorem 5.1 (Uniqueness of optimal transport on manifolds) 

Let X and Y be complete separable manifolds equipped with Borel probability 
measures fi on X and v onY . Let c G C X (X xY) be a bounded cost function 
such that for each y\ ^ y2 G Y, the map 




(6) 
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has no critical points, save at most one global minimum and at most one 
global maximum. Assume d x c(x,y) is locally bounded in x, uniformly in Y. 
If \x is absolutely continuous in each coordinate chart on X , then the mini- 
mum is uniquely attained; moreover, the minimizer 7 G r(/i, v) vanishes 
outside a numbered limb system having at most two limbs. 

Proof. We first prove that there is a numbered limb system having at 
most two limbs, outside of which the mass of all minimizers 7 vanishes. A 
detailed argument confirming the plausible fact that the graphs of these limbs 
are Borel subsets of X x Y will be given later. Uniqueness of 7 then follows 
from Theorem 14.21 

By linear programming duality (due to Kantorovich and Koopmans in 
this context), it is well-known |104] that there exist upper semi-continuous 
potentials q G L l (X, dfx) and r G L l (Y, dv) with 

q(x) = inf c(x,y) - r(y) (7) 

such that 

inf / c(x,y)d-y(x,y) = / q(x)dfi(x) + / r{y)dv{y). (8) 

7GlV,i/) JxxY JX JY 

From ([7]) we see 

c(x,y)-q(x)-r(y)>0, (9) 

and let 

Z := {{x, y) G X x Y\c(x, y) - q(x) - r{y) = 0} (10) 

denote the set where the non-negative function c(x, y) —q{x) —r(y) vanishes. 
Lower semi-continuity of this function implies Z is a closed subset of X x Y. 
Notice that (j8l) implies any minimizer 7 G T(/i, v) vanishes outside the zero 
set Z C X x Y of the non-negative function appearing in Q. It remains 
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to show this set Z is contained in a numbered limb system consisting of at 
most two limbs (apart from a fi £g> v negligible set). 

From (m), q is locally Lipschitz, since d x c(x,y) is controlled locally in x, 
independently of y G Y. Rademacher's theorem therefore combines with 
absolute continuity of \x to imply q is differentiable //-almost everywhere; we 
can safely ignore any points in X where differentiability of q fails, since they 
constitute a set of zero volume: 7[DomDg x Y] — fi[DomDq] = 1. Taking 
xo G DomDg, suppose (xo,yi) and (£0,2/2) both lie in Z, hence saturate the 
inequality Then d x c(xo, yi) = Dq(xo) = d x c(xo, 7/2)- In case the cost 
is twisted, meaning ^ has no critical points, we conclude y\ = y 2 hence 
Z PI (DomDg x Y) is contained in a graph. This completes the proofs by 
Gangbo and Levin of existence (and uniqueness) of a solution y\ = fi(xo) 
to Monge's problem, pairing almost every xq G X with a single y\ G y. 



Notice uniqueness follows from Lemma 3^ without further measurability 
assumptions. 

In the present setting, however, we only know that Xq must be a global 
minimum or global maximum of the function ([6]). Exchanging y\ with y<i if 
necessary yields 

q(x) < c(x, yi) - r(yi) < c(x, y 2 ) - r(y 2 ) (11) 
for all x G X, the second inequality being strict unless x = xq, in which 



case both inequalities are saturated. Strictness of inequality (11) implies 
{x,y 2 ) $l Z unless x = xq. In other words, (x,y 2 ) G Z lies on the antigraph 
of a function f 2 (y 2 ) = xo well-defined at y 2 . There may or may not be a 
point yo G Y different from y\ such that 

q(x) < c(x, y ) - r(y ) < c(x, y x ) - r{y x ) (12) 

for all x G X. If such a point y Q exists, then (xo,yi) G Antigraph(/ 2 ) as 
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above. If no such y exists, setting fi(x ) := y% yields Z R (DomDg x Y) C 
Graph(/i) U Antigraphy). Since the range of f\ is disjoint from the domain 
of f 2 , this completes the proof that — up to 7-negligible sets — Z lies in a 
numbered limb system with at most two limbs, as desired. 

Let us now prove Borel measurability of these limbs. To do this, we define 
the cross-difference as in McCann |76|, 



A(x, y, x\ y') := c(x, y) + c(x', y') - c(x, y') - c(V , y) 

which is a continuous function on (X x Y) 2 and notice that A < on Z 2 , 
i.e, any two (x,y) and (x',y') in Z satisfy 

A(x,y,x',y') < 

This well-known fact [M] can be deduced by summing the inequalities 

< c(x', y) — q(x') — r(y) = c(x', y) — q(x') + q(x) — c(x, y) 
< c(x, y') — q(x) — r(y') = c(x, y') — q(x) + q(x') — c(x', y'). 

Closedness of Z and a-compactness of B = X x Y imply 

h{x x ,y x ) := inf A(x 1 ,y 1 ,x,y 2 ) 

{(x,y2)eXxY\(x 1<y2 )GZ} 



is Borel on X x Y, according to Lemma 5.2 below. Taking y 2 = y\ implies 
h < on Z. A point (xi,yi) G Z is said to be marked if x± G X and 
h{x u yx) = 0, i.e 

c(x, yi) - c(x, y 2 ) < c(xi, y x ) - c{x 1: y 2 ) (13) 
for all x G X and (xi,y 2 ) G Z. This definition is equivalent to saying that 



there is no yo satisfying (12) , i.e , the set of marked points in Zf] (DomDq x 
Y) is equal to Graph^). This implies Graph(/x) = (DonrDg x F) fi Z fl 
{(x, y)\h(x, y) = 0} hence is a Borel subset of X x Y. Borel measurability of 
{h > 0} and hence Antigraphy) also follows. ■ 
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Lemma 5.2 Let A and B be topological spaces and Z C A x B be closed. If 
g : A x B — > RU { — 00} is lower semi- continuous, and B is o -compact, then 
the following function h is Borel: 

h(a) := inf q(a, b). 

{b£B\{a,b)£Z} 

Proof. See Lemma A.4 of |27]. ■ 

Let us conclude by recalling an example of an extremal doubly stochas- 
tic measure which does not lie on the graph of a single map, drawn from 
work of Gangbo and McCann [55] and Ahmad [3J on optimal transporta- 
tion, and developed in an economic context by Chiappori, McCann, and 
Nesheim [27]. Other examples may be found in the work of Seethoff and 
Shiflett |92j, Losert [69j, Hestir and Williams [5B], Gangbo and McCann [54], 
Uckelmann [M], McCann [75], and Plakhov [go] . 

Imagine the periodic interval X = Y = H/2ttZ = [0,2n[ to parame- 
terize a town built on the boundary of a circular lake, and let probability 
measures fi and v represent the distribution of students and available places 
in schools, respectively. Suppose the distribution of students is smooth and 
non-vanishing but peaks sharply at the northern end of the lake, and the 
distribution of schools is smooth, non- vanishing and sharply peaked at the 
southern end of the lake. If the cost of transporting a student residing at 
location 9 G [0, 2tt] to school at location <fi G [0, 2tt] is presumed to be given 
in terms of the angle commuted by c(9, <ft) = 1 — cos(6* — <p), the most effective 
pairing of students with places in schools is given by the measure in r(/i, v) 
which attains the minimum: 

min / c(M)<*7(M)- (14) 
According to results of Gangbo and McCann [55] which are generalized 



in Theorem 5.1, this minimizer is unique, and its support is contained in the 
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union of the graphs of two maps t : X — > Y. A schematic illustration is 
given in Figure [4j where the restriction of the support to the subsets marked 
by ± on the flat torus XxY represent graph(t + ) and graph(t~) respectively. 
The dotted lines mark 0-0 = ±§,±^. The necessary positivity of ^[Jx x 
Jyi] > in this picture may be explained by observing that although it 
is cost-effective for all students to attend a school where they live, this is 
incompatible with the concentration of students at the north end of the lake, 
and of schools at the south end. Once this imbalance is corrected by sending 
a sufficient number of northern students to southern schools by the map t , 
the remaining students can be assigned to school near their home using the 
map t + . Continuity of both of these maps is established in [55J and further 
quantified by McCann and Sosio |78j, and McCann, Pass and Warren [80J. 
Periodicity of graphs on the flat torus can be used to represent the support as 
a numbered limb system in more than one way; see Figure [5j which exploits 
the fact that the support of 7 in Figure [4] intersects X x Jy 2 in a graph and 
X x (Y — Jyi) in an anti-graph. 

Chiappori, Nesheim and McCann [27J called the uniqueness hypothesis 
limiting the number of critical points to at most one maximum and at most 
one minimum in ^ the subtwist condition. Although it is satisfied in the 
example above, it is an unfortunate fact that the subtwist condition cannot 
be satisfied by any smooth function c(9, 4>) on a product of manifolds X xY 
with more complicated Morse structures than the sphere. It is an interesting 
open problem to find a criterion on a smooth cost c(6, <fi) on X = Y = R 2 /Z 2 



which guarantees uniqueness of the minimum (14) for all smooth densities 
fj, and v on the torus. Although we expect such costs to be generic, not a 
single example of such a cost is known to us. Hestir and Williams criteria for 
extremality seems likely to remain relevant to such questions, and it is natural 
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to conjecture that the complexity of the Morse structure of the manifold X 
plays a role in determining the required number of limbs in the system. 




6 Epilog 



The connections of optimal transportation to geometry and curvature - 
sectional [62] [68], Ricci [29] [70] [71] [79] [83] [95] [HI], and mean [63] - 
have become abundantly clear in recent years. Connections to differential 
topology remained largely unsuspected. The results reviewed above highlight 
the delicacy of identifying the extremality of a doubly stochastic measure 
from its support, and the role played by critical points of the transportation 
cost ^ in guaranteeing the uniqueness of the extremal measure 7 6 r(/i, u) 
which solves a Kantorovich transportation problem (14) set on the ball or 
sphere X. When the sources /1 are continuously distributed, the topology 
of the landscape X limits the support of 7 to lie on a graph in the case of 
a ball, and a numbered limb system with two limbs in the case of a sphere. 
This characterization is dimension independent. For landscapes with more 
complicated topology, not a single example of a cost function c G C 1 (X x 
Y) is known to guarantee uniqueness of optimal measure for all continuous 
densities /1 and v — nor is anything known about the support of 7 beyond 
its numbered limb system structure and the local rectifiability determined 
by the rank of the cost 



References 

[1] T. Abdellaoui and H. Heinich. Sur la distance de deux lois dans le cas 
vectoriel. C.R. Acad. Sci. Pans Ser. I Math. 319 (1994) 397-400. 

[2] A. Agrachev and P.W.Y. Lee. Optimal transportation under non- 
holomic constraints. Trans. Amer. Math. Soc. 361 (2009) 6019-6047. 



25 



[3] N.Ahmad. The geometry of shape recognition via a Monge-Kantorovich 
optimal transport problem. PhD thesis, Brown University, 2004. 

[4] L. Ambrosio. Lecture notes on optimal transport problems. In Math- 
ematical Aspects of Evolving Interfaces, volume 1812 of Lecture Notes 
in Mathematics, pages 1-52. Springer, Berlin, 2003. 

[5] L. Ambrosio and S. Rigot. Optimal transportation in the Heisenberg 
group. J. Fund. Anal. 208 (2004) 261-301. 

[6] L. Ambrosio, B. Kirchheim, and A. Pratelli. Existence of optimal trans- 
port maps for crystalline norms. Duke Math. J. 125 (2004) 207-241. 

[7] L.A. Ambrosio, N. Gigli, and G. Savare. Gradient flows in metric spaces 
and in the space of probability measures. Lecture Notes in Mathematics 
ETH Zurich. Birkhauser Verlag, Basel, 2005. 

[8] V. Benes and J. Stepan. The support of extremal probability measures 
with given marginals. In M.L. Puri, P. Revesz and W. Wertz, editors, 
Mathematical Statistics and Probability Theory. Volume A: Theoretical 
Aspects, pages 33-41. D. Reidel Publishing Co., Dordrecht, 1987. 

[9] P. Bernard and B. Buffoni. The Monge problem for supercritical Mane 
potentials on compact manifolds. Adv. Math. 207 (2006) 691-706. 

[10] P. Bernard and B. Buffoni. Optimal mass transportation and Mather 
theory. J. Eur. Math. Soc. (JEMS) 9 (2007) 85-121. 

[11] J. Bertrand. Existence and uniqueness of optimal maps on Alexandrov 
spaces. Adv. Math. 219 (2008) 838-85. 



26 



[12] S. Bianchini and F. Cavalletti. The Monge prob- 

lem for distance cost in geodesic spaces. Preprint at 

http : //cvgmt . sns . it/papers/biacav09/Monge@problem.pdf . 

[13] G. Birkhoff. Three observations on linear algebra. Univ. Nac. Tucumn. 
Revista A 5 (1946) 147-151. 

[14] G. Birkhoff. Lattice Theory. Revised Edition, volume 25 of Colloquium 
Publications. American Mathematical Society, New York, 1948. 

[15] G. Bouchitte, G. Buttazzo, and P. Seppecher. Shape optimization 
solutions via Monge-Kantorovich equation. C.R. Acad. Sci. Paris Ser. 
7 324 (1997) 1185-1191. 

[16] G. Bouchitte, W. Gangbo and P. Seppecher. Michell trusses and lines 
of principal action. Math. Models Methods Appl. Sci. 18 (2008) 1571- 
1603. 

[17] Y. Brenier. Decomposition polaire et rearrangement monotone des 
champs de vecteurs. C.R. Acad. Sci. Paris Ser. I Math. 305 (1987) 
805-808. 

[18] Y. Brenier. Polar factorization and monotone rearrangement of vector- 
valued functions. Comm. Pure Appl. Math. 44 (1991) 375-417. 

[19] L. Caffarelli. Allocation maps with general cost functions. In P. Mar- 
cellini et al, editors, Partial Differential Equations and Applications, 
number 177 in Lecture Notes in Pure and Appl. Math., pages 29-35. 
Dekker, New York, 1996. 

[20] L.A. Caffarelli. The regularity of mappings with a convex potential. J. 
Amer. Math. Soc. 5 (1992) 99-104. 



27 



[21] L.A. Caffarelli. Boundary regularity of maps with convex potentials - 
II. Ann. of Math. (2) 144 (1996) 453-496. 

[22] L.A. Caffarelli, M. Feldman and R.J. McCann. Constructing optimal 
maps for Monge's transport problem as a limit of strictly convex costs. 
J. Amer. Math. Soc. 15 (2002) 1-26. 

[23] L. Caravenna. An existence result for the Monge problem in R n with 



norm cost. Preprint at http://hdl.handle.net/1963/3647 



[24] G. Carlier. A general existence result for the principal-agent problem 
with adverse selection. J. Math. Econom. 35 (2001) 129-150. 

[25] G. Carlier and I. Ekeland. Structure of cities. J. Global Optim. 29 
(2004) 371-376. 

T. Champion and L. De Pascale. The Monge problem in H d . Preprint at 



http : //cvgmt . sns . it/papers/chadep09/champion-depascale . pdf 



[27] P.-A. Chiappori, R.J. McCann, and L. Nesheim. Hedonic price equilib- 
ria, stable matching and optimal transport: equivalence, topology and 
uniqueness. Econom. Theory 42 (2010) 317-354. 

[28] D. Cordero-Erausquin. Sur le transport de mesures periodiques. C.R. 
Acad. Sci. Paris Ser. I Math. 329 (1999) 199-202. 

[29] D. Cordero-Erausquin, R.J. McCann and M. Schmuckenschlager. A 
Riemannian interpolation inequality a la Borell, Brascamp and Lieb. 
Invent. Math. 146 (2001) 219-257. 

[30] D. Cordero-Erausquin, B. Nazaret, and C. Villani. A mass- 
transportation approach to sharp Sobolev and Gagliardo-Nirenberg in- 
equalities. Adv. Math. 182 (2004) 307-332. 



28 



[31] J. A. Cuesta-Albertos and C. Matran. Notes on the Wasserstein metric 
in Hilbert spaces. Ann. Probab. 17 (1989) 1264-1276. 

[32] J. A. Cuesta-Albertos and A. Tuero-Diaz. A characterization for the 
solution of the Monge-Kantorovich mass transference problem. Statist. 
Probab. Lett. 16 (1993) 147-152. 

[33] M.J. P. Cullen. A Mathematical Theory of Large Scale Atmo- 
sphere/Ocean Flows. Imperial College Press, London, 2006. 

[34] M.J.P Cullen and R.J. Purser. An extended Lagrangian model of semi- 
geostrophic frontogenesis. J. Atmos. Sci. 41 (1984) 1477-1497. 

[35] M.J.P Cullen and R.J. Purser. Properties of the Lagrangian semi- 
geostrophic equations. J. Atmos. Sci. 46 (1989) 2684-2697. 

[36] P. Delanoe. Classical solvability in dimension two of the second 
boundary-value problem associated with the Monge-Ampere operator. 
Ann. Inst. H. Poincare Anal. Non Lineaire 8 (1991) 443-457. 

[37] J.L. Denny. The support of discrete extremal measures with given 
marginals. Michigan Math. J. 27 (1980) 59-64. 

[38] R. Dobrushin. Definition of a system of random variables by means of 
conditional distributions (Russian). Teor. Verojatnost. i Primenen. 15 
(1970) 469-497. 

[39] R.D. Douglas. On extremal measures and subspace density. Michigan 
Math. J. 11 (1964) 243-246. 

[40] R.M. Dudley. Real Analysis and Probability. Revised reprint of the 1989 
original. Cambridge University Press, Cambridge, 2002. 



29 



[41] I. Ekeland. Existence, uniqueness and efficiency of equilibrium in he- 
donic markets with multidimensional types. Econom. Theory 42 (201) 
275-315. 

[42] L.C. Evans and W. Gangbo. Differential equations methods for the 
Monge-Kantorovich mass transfer problem. Mem. Amer. Math. Soc. 
137 (1999) 1-66. 

[43] A. Fathi and A. Figalli. Optimal transportation on non-compact man- 
ifolds. Israel J. Math. 175 (2010) 1-59. 

[44] M. Feldman and R.J. McCann. Monge's transport problem on a Rie- 
mannian manifold. Trans. Amer. Math. Soc. 354 (2002) 1667-1697. 

[45] M. Feldman and R.J. McCann. Uniqueness and transport density in 
Monge's transportation problem. Calc. Var. Partial Differential Equa- 
tions 15 (2002) 81-113. 

[46] A. Figalli. Existence, uniqueness and regularity of optimal transport 
maps. SI AM J. Math. Anal. 39 (2007) 126-137. 

[47] A. Figalli. The Monge problem on non-compact manifolds. Rend. Sem. 
Mat. Univ. Padova, 117:147-166, 2007. 

[48] A. Figalli and N. Gigli. Local seimconvexity of Kantorovich potentials 
on noncompact manifolds. To appear in ESAIM Control Optim. Calc. 
Var. 

[49] A. Figalli, Y.-H. Kim, and R.J. McCann. When is mul- 
tidimensional screening a convex program? Preprint at 
www . math . toronto . edu/mccann. 



30 



[50] A. Figalli and L. Rifford. Mass transportation on sub-Riemannian 
manifolds. Geom. Fund. Anal. 20 (2010) 124-159. 

[51] A. Figalli, F. Maggi and A. Pratelli. A mass transportation approach 
to quantitative isoperimetric inequalities. To appear in Invent. Math. 

[52] W. Gangbo. Habilitation thesis. Universite de Metz, 1995. 

[53] W. Gangbo and R.J. McCann. Optimal maps in Monge's mass trans- 
port problem. C.R. Acad. Sci. Pans Ser. I Math. 321 (1995) 1653- 
1658. 

[54] W. Gangbo and R.J. McCann. The geometry of optimal transportation. 
Acta Math. 177 (1996) 113-161. 

[55] W. Gangbo and R.J. McCann. Shape recognition via Wasserstein dis- 
tance. Quart. Appl. Math. 58 (2000) 705-737. 

[56] N. Gigli. On the inverse implication of Brenier-McCann 
theorems and the structure of (P2(M),W2). Preprint at 
http : //cvgmt . sns . it/papers/gigc/Inverse .pdf , 

[57] T. Glimm and V. Oliker. Optical design of single reflector systems 
and the Monge-Kantorovich mass transfer problem. J. Math. Sci. 117 
(2003) 4096-4108. 

[58] K. Hestir and S.C. Williams. Supports of doubly stochastic measures. 
Bernoulli 1 (1995) 217-243. 

[59] R. Jordan, D. Kinderlehrer and F. Otto. The variational formulation 
of the Fokker-Planck equation. SIAM J. Math. Anal. 29 (1998) 1-17. 



31 



[60] L. Kantorovich. On the translocation of masses. C.R. (Doklady) Acad. 
Sci. URSS (N.S.) 37 (1942) 199-201. 

[61] L. Kantorovich. On a problem of Monge (In Russian). Uspekhi Math. 
Nauk. 3 (1948) 225-226. 

[62] Y.-H. Kim and R.J. McCann. Continuity, curvature, and 
the general covariance of optimal transportation. Preprint at 
www.math.toronto.edu/mccann. To appear in J. Eur. Math. Soc. 
(JEMS). 

[63] Y.-H. Kim, R.J. McCann and M. Warren. Pseudo-Riemannian 
geometry calibrates optimal transportation. Preprint at 

www.math.toronto.edu/mccann. To appear in Math. Res. Lett. 

[64] M. Knott and OS. Smith. On the optimal mapping of distributions. 
J. Optim. Theory Appl. 43 (1984) 39-49. 

[65] T.C. Koopmans. Optimum utilization of the transportation system. 
Econometrica (Supplement) 17 (1949) 136-146. 

[66] V.L. Levin. Abstract cyclical monotonicity and Monge solutions for the 
general Monge-Kantorovich problem. Set-valued Anal. 7 (1999) 7-32. 

[67] J. Lindenstrauss. A remark on doubly stochastic measures. Amer. 
Math. Monthly 72 (1965) 379-382. 

[68] G. Loeper. On the regularity of solutions of optimal transportation 
problems. Acta Math. 202 (2009) 241-283. 

[69] V. Losert. Counterexamples to some conjectures about doubly stochas- 
tic measures. Pacific J. Math. 99 (1982) 387-397. 



32 



[70] J. Lott. Optimal transport and Perelman's reduced volume. Calc. Var. 
Partial Differential Equations 36 (2009) 49-84. 

[71] J. Lott and C. Villani. Ricci curvature for metric measure spaces via 
optimal transport. Annals Math. (2) 169 (2009) 903-991. 

[72] X.-N. Ma, N. Trudinger and X.-J. Wang. Regularity of potential func- 
tions of the optimal transportation problem. Arch. Rational Mech. 
Anal. 177 (2005) 151-183. 

[73] R.J. McCann. A Convexity Theory for Interacting Gases and Equilib- 
rium Crystals. PhD thesis, Princeton University, 1994. 

[74] R.J. McCann. A convexity principle for interacting gases. Adv. Math. 
128 (1997) 153-179. 

[75] R.J. McCann. Equilibrium shapes for planar crystals in an external 
field. Comm. Math. Phys. 195 (1998) 699-723. 

[76] R.J. McCann. Exact solutions to the transportation problem on the 
line. R. Soc. Lond. Proc. Ser. A Math. Phys. Eng. Sci., 455 (199) 
1341-1380. 

[77] R.J. McCann. Polar factorization of maps on Riemannian manifolds. 
Geom. Funct. Anal. 11 (2001) 589-608. 

[78] R.J. McCann and M. Sosio. Holder continuity of optimal multivalued 
mappings. Preprint at www. math. toronto/mccann. 

[79] R.J. McCann and P. Topping. Ricci flow, entropy, and optimal trans- 
portation. Amer. J. Math. 132 (2010) 711-730. 



33 



[80] R.J. McCann, B. Pass and M. Warren. Rectifiability of optimal trans- 
portation plans. Preprint at www. math. toronto/mccann. 

[81] G. Monge. Memoire sur la theorie des deblais et de remblais. His- 

toire de I'Academie Roy ale des Sciences de Paris, avec les Memoires 
de Mathematique et de Physique pour la meme annee, pages 666-704, 
1781. 

[82] F. Otto. The geometry of dissipative evolution equations: The porous 
medium equation. Comm. Partial Differential Equations 26 (2001) 
101-174. 

[83] F. Otto and C. Villani. Generalization of an inequality by Talagrand 
and links with the logarithmic Sobolev inequality. J. Fund. Anal. 173 
(2000) 361-400. 

[84] B. Pass. The multi-marginal optimal transportation problem. Preprint. 

[85] A.Yu. Plakhov. Exact solutions of the one-dimensional Monge- 
Kantorovich problem (Russian). Mat. Sb. 195 (2004) 57-74. 

[86] A.Yu. Plakhov. Newton's problem of the body of minimal averaged 
resistance (Russian). Mat. Sb. 195 (2004) 105-126. 

[87] J. Purser and M.J.P. Cullen. J. Atmos. Set 44 (1987) 3449-3468. 

[88] S.T. Rachev and L. Ruschendorf. Mass Transportation Problems. 
Probab. Appl. Springer- Verlag, New York, 1998. 

[89] L. Ruschendorf. Optimal solutions of multivariate coupling problems. 
Appl. Math. (Warsaw) 23 (1995) 325-338. 



34 



[90] L. Riischendorf. On c-optimal random variables. Statist. Probab. Lett. 
37 (1996) 267-270. 

[91] L. Riischendorf and S.T. Rachev. A characterization of random vari- 
ables with minimum L 2 -distance. J. Multivariate Anal. 32 (1990) 48- 
54. 

[92] T.L. Seethoff and R.C Shiflett. Doubly stochastic measures with pre- 
scribed support. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete 41 

(1978) 283-288. 

[93] C. Smith and M. Knott. On the optimal transportation of distributions. 
J. Optim. Theory Appl. 52 (1987) 323-329. 

[94] C. Smith and M. Knott. On Hoeflding-Frechet bounds and cyclic mono- 
tone relations. J. Multivariate Anal. 40 (1992) 328-334. 

[95] K.-T. Sturm. On the geometry of metric measure spaces, I and II. Acta 
Math. 196 (2006) 65-177. 

[96] V.N. Sudakov. Geometric problems in the theory of infinite- 
dimensional probability distributions. Proc. Steklov Inst. Math. 141 

(1979) 1-178. 

[97] H. Tanaka. An inequality for a functional of probability distributions 
and its application to Kac's one-dimensional model of a Maxwellian 
gas. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete 27 (1973) 47- 
52. 

[98] N.S. Trudinger. Isoperimetric inequalities for quermassintegrals. Ann. 
Inst. H. Poincar Anal. Non Linaire 11 (1994) 411-425. 



35 



[99] N.S. Trudinger and X.-J. Wang. On the Monge mass transfer problem. 
Calc. Var. Paritial Differential Equations 13 (2001) 19-31. 

[100] N.S. Trudinger and X.-J. Wang. On the second boundary value problem 
for Monge-Ampere type equations and optimal transportation. Ann. 
Sc. Norm. Super. Pisa CI. Sci. (5) 8 (2009) 1-32. 

[101] L. Uckelmann. Optimal couplings between onedimensional distribu- 
tions. In V. Benes and J. Stepan, editors, Distributions with given 
marginals and moment problems, pages 261-273. Kluwer Academic 
Publishers, Dordrecht, 1997. 

[102] J. Urbas. On the second boundary value problem for equations of 
Monge-Ampere type. J. Reme Angew. Math. 487 (1997) 115-124. 

[103] C. Villani. Cours d Integration et Analyse de Fourier. 



http:/ /www. umpa.ens-lyon.fr/~cvillani/Cours/iaf-2006. html, 2006. 



[104] C. Villani. Optimal Transport. Old and New, volume 338 of 

Grundlehren der Mathematischen Wissenschaften [Fundamental Prin- 
ciples of Mathematical Sciences]. Springer, New York, 2009. 

[105] J. von Neumann. A certain zero-sum two-person game equivalent to the 
optimal assignment problem. In Contributions to the theory of games. 
Volume 2, pages 5-12. Princeton University Press, Princeton NJ, 1953. 

[106] X.-J. Wang. On the design of a reflector antenna. Inverse Problems 12 
(1996) 351-375. 

[107] X.-J. Wang. On the design of a reflector antenna II. Calc. Var. Partial 
Differential Equations 20 (2004) 329-341. 



36 



